Variational Bayesian speaker clustering
نویسندگان
چکیده
In this paper we explore the use of Variational Bayesian (VB) learning in unsupervised speaker clustering. VB learning is a relatively new learning technique that has the capacity of doing at the same time parameter learning and model selection. We tested this approach on the NIST 1996 HUB-4 evaluation test for speaker clustering when the speaker number is a priori known and when it has to be estimated. VB shows a higher accuracy in terms of average cluster purity and average speaker purity compared to the Maximum Likelihood solution.
منابع مشابه
Infinite models for speaker clustering
In this paper we propose the use of infinite models for the clustering of speakers. Speaker segmentation is obtained trough a Dirichlet Process Mixture (DPM) model which can be interpreted as a flexible model with an infinite a priori number of components. Learning is based on a Variational Bayesian approximation of the infinite sequence. DPM model is compared with fixed prior systems learned b...
متن کاملScoring unknown speaker clustering : VB vs. BIC
This paper aims at comparing the Bayesian Information Criterion and the Variational Bayesian approach for scoring unknown multiple speakerclustering. Variational Bayesian learning is a very effective method that allows parameter learning and model selection at the same time. The application we consider here consists in finding the optimal clustering in a conversation where the speaker number is...
متن کاملRegroupement bayesien variationnel des locuteurs
In this paper we explore the use of Variational Bayesian (VB) learning in unsupervised speaker clustering. VB learning is a relatively new learning technique that has the capacity of doing at the same time parameter learning and model selection. We run experiments on the NIST 1996 HUB-4 evaluation test for speaker clustering. Two cases are considered : the speaker number is a priori known and i...
متن کاملBayesian Approaches in Speech Recognition
This paper focuses on applications of Bayesian approaches to speech recognition. Bayesian approaches have been widely studied in statistics and machine learning fields, and one of the advantages of the Bayesian approaches is to improve generalization ability compared to maximum likelihood approaches. The effectiveness for speech recognition is shown experimentally in speaker adaptation tasks by...
متن کاملFully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model
We have proposed a novel speaker clustering method based on a hierarchically structured utterance-oriented Dirichlet process mixture model. In the proposed method, the number of speakers can be determined from the given data using a nonparametric Bayesian manner and intra-speaker variability is successfully handled by multi-scale mixture modeling. Experimental result showed that the proposed me...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004